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Abstract 



A classical problem in phylogenetic tree analysis is to decide whether 
there is a phylogenetic tree T that contains all information of a given 
collection V of phylogenetic trees. If the answer is "yes" we say that V 
is compatible and T displays V . This decision problem is NP-complete 
even if all input trees are quartets, that is binary trees with exactly 
four leaves. In this paper, we prove a sufficient condition for a set 
of binary phylogenetic trees to be compatible. That result is used to 
give a short and self-contained proof of the known characterization of 
quartet sets of minimal cardinality which are displayed by a unique 
phylogenetic tree. 

Keywords: Phylogenetics, Supertrees, Quartets 



1 Introduction 

Phylogenetic trees are used in computational biology to visualize the evolu- 
tionary relationship between some taxa (e.g. species or genes). For many 
methods in phylogenetic tree reconstruction, the required running time grows 
at least exponentially with the number of taxa. Hence, it is a natural and 
widely used approach to construct a collection of phylogenetic trees with 
small taxa sets first and then a supertree (see [1] for an overview), that is a 
phylogenetic tree for all taxa that are contained in at least one of the small 
trees. However, it is NP-complete to decide whether there is a supertree that 
displays all input trees, even if each of them contains only four taxa [H] . 

Mathematically, a phylogenetic tree is a tree without vertices of degree 2 
and it is called binary if every interior vertex has degree 3. The set of leaves 
or taxa set of a phylogenetic tree T is denoted by L(T) and, for a set V 
of phylogenetic trees, we define L(V) = L(T). Let T and T' be two 

phylogenetic trees with L(T') C L(T) where T' is binary. We say that T' 
is displayed by T if T' is the phylogenetic tree obtained from the smallest 
subtree of T that contains L(T") by suppressing all vertices of degree 2. In 
that case T" is called the restriction ofT to L(T') and is denoted by T\ut')- 
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A set V of binary phylogenetic trees is compatible if there is a phylogenetic 
tree T that displays every tree in V and in that case we say that T displays V . 
A cherry of a phylogenetic tree is a set of two leaves which are adjacent to the 
same vertex of degree 3. A quartet is a binary phylogenetic tree with exactly 
four leaves and the quartet with leaf set {a, b, c, d} and cherries {a, b} and 
{c, d} is denoted by ab\cd. A set V of binary phylogenetic trees is definitive 
if there is exactly one phylogenetic tree T with taxa set L(V) displaying V 
and in that case we say that V defines T. Here, two phylogenetic trees T\ 
and T2 are defined to be the same (or isomorphic) if L(T\) = L{T<i) and there 
is a graph isomorphism from T\ to T2 that maps every leaf to itself. For 
example, both phylogenetic trees depicted in Figure [T] display the quartets 
ad\cf, ae\bf, and bd\ce thus the set {ad\cf, ae\bf, bd\ce} is not definitive. 

The main result of this paper is a sufficient condition for a set of binary 
phylogenetic trees to be compatible. We need some more definitions to state 
it. The set of interior edges of a phylogenetic tree T is denoted by E(T). For 
a set V of phylogenetic trees, we define the excess ofV, denoted exc(P) by 



Finally, we say that V is excess-free if exc(P) = 0, we define Exc(P) = {V' C 
V : exc(V') = 0}, and we call V slim if exc(V') > for every non-empty 
subset V' C V. In the next section, we will prove: 

Theorem 1.1 Every slim set of binary phylogenetic trees is compatible. 

It depends only on the involved leaf sets of the trees and not on which 
phylogenetic tree is chosen for a fixed leaf set whether a collection of binary 
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(V) = \L(V)\-3-J2\E(T)\- 
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Figure 1: Two phylogenetic trees. 
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phylogenetic trees is slim. Nevertheless, Theorem 11.11 is sharp in the sense 
that, for every binary phylogenetic tree T, there are slim sets of quartets that 
define T [6]. 

After proving Theorem 11.11 in Section [2j we will apply it in Section [3] 
to give a new proof of the known characterization of excess-free definitive 
quartet sets. An outline of the original proof is given in [6] while the full 
details can be found in [2J. That characterization has been referred to as 
"one of the most mysterious and apparently difficult results in phylogenetics" 

0- 



2 Proof of Theorem 11.1 



We assume that the statement is wrong and that V is a minimal non- 
compatible slim set of binary phylogenetic trees where L(V) is minimal under 
all non-compatible slim sets of binary phylogenetic trees with cardinality \V\. 
Clearly, we have (-^(T 1 )! > 4 for every tree T EV and there is no subset V\ of 
V with exc(Pi) = and 1 < \P-^\ < \V\ since otherwise V\ can be replaced 
by a single binary phylogenetic tree with leaf set L(Vi) that displays V\, 
contradicting the minimality of \V\. 

We claim that, for every cherry {x,y} of T for some tree T in V, there 
is another tree T' in V that contains x and y as leaves but not as a cherry. 
Assuming that the claim is wrong, let V' be the set of phylogenetic trees 
obtained from V by replacing x and y by a new leaf z, i.e. if a tree T £ V 
contains either x or y then this leaf is replaced by z and if T contains a cherry 
{x,y} then the cherry is replaced by z. Clearly, we have \L(V')\ < \L(V)\ 
and V' is not compatible since the tree obtained from a phylogenetic tree 
displaying V' by replacing z by a cherry {x, y} would display V. Hence, by 
induction hypothesis, V' is not slim. Let V' 2 be a subset of V' with negative 
excess and let V2 be the subset of V obtained from V' 2 by reversing the 
identification of x and y, i.e. replacing z by a cherry {x, y} or a leaf x or y 
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for every tree in V' 2 that contains z. We have 

exc(P 2 ) = \L(V 2 )\-3- J2 \E(T)\ 

TeV 2 

< \L(V' 2 )\ + l-3- J2 \E(T)\ 

TeV 2 

= exc{V 2 ) + 1 < 0. 

This implies P 2 = V or \Pi\ = 1 but for both cases we get exc(V' 2 ) > 
exc("P 2 ) > 0, a contradiction. This proves the claim. 

We construct a digraph Gp = (Vp, Aj>) with vertex set Vp = V and an 
arc labeling A that associates a set of two elements of L(V) with every arc 
a G A>p as follows: For every tree T G V, we choose two different cherries 
{xi(T),yi(T)} and {x2(T), y 2 (T)} of T and, for i G {1, 2}, we choose T;(T) G 
P such that Xi(T),yi(T) G L(Tj) but {^(T), ^(T)} is not a cherry of Tj(T). 
Then we define A<p = {(T,T\(T)) : T e V,i E {1,2}} and A((T,T i (T))) = 
{xj(T), 7/j(T)} for all T eV and i G {1,2}. For the ease of notation, we will 
write A(T 1 ,T 2 ) rather than A((Ti,T 2 )) for an arc (Ti,T 2 ) G A<p. 



gh 
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Figure 2: The (in this case well-defined) arc-labeled digraph G<p for "P = 
{qi,...,qe} with g x = ab\cd, q 2 = ac\ef, q 3 = ae\bg, q 4 = bd\gh, q 5 = cf\dh, 
and q 6 = eg\fh. 

By construction, the outdegree d + (T) = 2 for every tree T G V thus we 
have |A(G")| < 2|V(G f/ )| for every subdigraph G' of Gj> and equality holds 



5 



for G' = G<p. Let G be a minimal subdigraph of G<p with \A(G)\ = 2\V(G)\, 
let V3 be the vertex set of G and let X = L(V%). We define an x-coloured 
cycle of G to be a cycle C of G such that x G A(a) for every arc a of C. We 
call C a coloured cycle if there is x G X such that C is an x-coloured cycle. 

Since two different cherries of a binary tree are disjoint, A(Ti,T 2 ) fl 
A(Ti,73) 7^ implies T 2 = T3. Hence, every coloured cycle C of G is a 
directed cycle since otherwise there is a vertex of outdegree 2 in C but the 
two outgoing arcs must have disjoint label sets. This implies that every arc 
with label set {x, y} is contained in at most one x-coloured and in at most 
one y-coloured cycle. 

Recall that the cycle space of a graph H = (V, E) is the vector space over 
the 2-element field Z 2 generated by the cycles of H (considered as binary 
vectors indexed by E), together with the symmetric difference as addition. 
This makes it possible to decide whether a collection of cycles of H is linearly 
dependent. The dimension of the cycle space is called the cyclomatic number 
c(H), and if H is connected we have c(H) — \E\ — \V\ + 1. For a digraph, 
we define the cycle space and the cyclomatic number to be the ones of the 
underlying undirected graph. 

We will now show that the number of coloured cycles in G is at most 
c(G) + l. This clearly holds if all coloured cycles of G are linearly independent. 
Let S be a minimal linearly dependent set of coloured cycles of G and let Gs 
be the subdigraph of G induced by V(Gs) = Uces By the minimality 

of S, G,s is connected and every arc of Gs is contained in at least two cycles 
of S. Let (Ti,T 2 ) and (T 2 ,T 3 ) be arcs of an x-coloured cycle C G S, let 
A(Tx,T 2 ) = {x,y} and A(T 2 ,T 3 ) = {x,z}. We have y 7^ z and there are arcs 
(T 4 ,T 2 ) and (T 2 ,T 5 ) in G s with y G A(T 2 ,T 5 ) and z G A(T 4 ,T 2 ) since there 
is a y-coloured cycle containing (Ti,T 2 ) and a z-coloured cycle containing 
T 2 ,T3 in S. Hence, the degree of T 2 in G,s as well as the degree of every 
other vertex of Gs is at least 4. This implies |A(Crs)| > 2|V(Crs)| but we 
have |^4(G')| < 2|V(G f/ )| for every proper subdigraph of G. So we must have 
G s = G and S is the set of all coloured cycles of G. This means that the 
number of coloured cycles in G is at most c(G) + 1. Since G is connected 
we have c(G) = \A(G)\ - \V(G)\ + 1 = 2\V 3 \ ~ \Vs\ + 1 = |Ps| + 1 and the 
number of coloured cycles in G is at most |"P 3 | + 2. 
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We conclude the proof by showing that P 3 has negative excess, in con- 
tradiction to our assumption that Q is slim. For x G X, let G x = (V x , A x ) be 
the subdigraph of G such that V x is the set of all trees in V3 that contain x 
and A x is the set of all arcs of G whose label sets contain x. Then all cycles 
in G x are vertex-disjoint, so we have \V X \ > 1 + \A X \ — c x where c x is the 
number of cycles in G x . This implies 

\ V *\ ^ \ X \ + 2 \ A (°)\ - 1^1 -2 = \X\+ 3\V 3 \ - 2. 

xex 

Finally, we get 

exc(P 3 ) = \X 
= \X 

= \X 

< \X 
= -1 



-3- x; 

-3- Xd L ( T )|- 3 ) 

-3 + 3\V 3 \-J2\V x \ 

xex 

-3 + 3|P 3 | - \X\-3\V 3 \+2 



3 Definitive Quartet Sets of Minimum Size 

Let T be a phylogenetic tree with {a, b, c, d} C L(T) and let e £ E. Then the 
quartet a&|c<i distinguishes e if T displays a6|c(i and e is the only edge that 
the paths from a to c and from b to in T have in common. We summarize 
some elementary results about definitive quartet sets. Proofs can be found 
in [8]. 

Lemma 3.1 Let Q be a quartet set that defines a phylogenetic tree T with 
\L(T)\ = n. Then T is binary and for every interior edge e ofT, there must 
be a quartet in Q that distinguishes e. Hence, we have \Q\ > n — 3 and, in 
case of equality, every quartet in Q distinguishes an interior edge ofT. 
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Combining Theorem 11.11 with Lemma 13.11 we get the following corollary: 



Corollary 3.2 Let Q be a quartet set that defines a phylogenetic tree T with 
\L(T)\ = n. If \Q\ = n — 3 ; then Q is slim and every excess- free subset of Q 
is definitive. 

Proof: Assume that \Q\ = n — 3 and Q is not slim. Let Q' be a subset of 
Q with negative excess. Since T\l(q>) contains \L(Q') \ — 3 interior edges but 
Q' contains more than |L(Q')| — 3 quartets there must be an interior edge 
in T that is not distinguished by a quartet in Q, a contradiction to Lemma 
13.11 Now assume that an excess-free subset Q' of Q is not definitive. Let T' 
with L(T') = L(Q') be a different tree from T\l(q') that displays Q' . Then 
Q — Q' + T' is slim and compatible in view of Theorem 11.11 However, a tree 
that displays Q — Q' + X" displays Q as well but it is different from T. Hence, 
Q can not be definitive. I 

For a set of two binary phylogenetic trees, it is easy to decide if they are 
compatible resp. definitive. The proof of the next lemma is straight forward 
and omitted. 

Lemma 3.3 Let V = {Tx^T^) be a set of two binary phylogenetic trees with 
L(T\) n LiT-i) ^ 0. Then V is compatible if and only if there is a binary tree 
Ti i2 with leaf set L(Ti)r\L(T 2 ) that is displayed by T\ andT 2 . V is definitive 
if and only if, in addition, the sets of edges in T 12 which are subdivided in 
T\ and T 2 , respectively, are disjoint. 

For a set Q and a set C of subsets of Q, the pair (Q, C) is called a patchwork 
if, for every two sets C±, C 2 6 C with C\ fl C 2 ^ 0, we have C\ fl C 2 G C and 
C\ U C 2 G C. Patchworks were introduced by Bocker and Dress [5]. An easy 
proof of the following lemma can be found in [8], p. 136. 

Lemma 3.4 Let T be a binary phylogenetic tree and let Q be an excess-free 
quartet set such that every interior edge ofT is distinguished by an element 
ofQ. Then (Q,Exc(Q)) is a patchwork. 
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Figure 3: Two trees T\ and T 2 and the restriction Ti. 2 of both trees to the 
intersection of their leaf sets. The dashed edges have to be subdivided to 
obtain T\ from Ti j2 and the dotted edge has to be subdivided to obtain T 2 . 



A set C of non-empty subsets of a set Q is called a hierarchy of Q if, for 
every two sets C±, C 2 G C with C\ PI C 2 7^ 0, we have Ci C C 2 or C 2 C Cj. 
The elements of a hierarchy are called clusters. A hierarchy C of Q is maximal 
if Q E C and, for every set C G C with |C| > 1, there are Ci,C 2 G C such 
that C = Ci U C 2 and C a n C 2 = 0. We call d and C 2 the children of C. 
This is only one of several different definitions of a maximal hierarchy and 
their equivalence is shown in pE]. A patchwork (Q,C) is called ample if there 
is a maximal hierarchy C of Q with C C C. The next lemma is a special 
case of the results in [5] where ample patchworks are studied in detail. 

Lemma 3.5 Let Q be a finite set and let (Q,C) be an ample patchwork with 
C G C. Then there is a maximal hierarchy Cc of Q with C G Cc G C. 

Proof: Assume the statement is wrong and C G C is maximal such that 
there is no maximal hierarchy Cc of Q with C G Cc Q C. Clearly, we have 
C ^ Q. Let C C C be a maximal hierarchy of Q. Then d = {C n C : C G 
C and C n C" 7^ 0} is a maximal hierarchy of C with Ci C C. Let A e C be 
minimal such that AflC^I and AUC^C and let B be the child of A with 
-B nC = 0. Since C is a patchwork we have that AUC G C and, by definition, 
AUC is the disjoint union of B and C. In view of the maximality of C there 
is a maximal hierarchy C" C C of Q with AUC G C". By replacing all proper 
subsets of A U C in C" by the union of C\ and the maximal hierarchy of B 
contained in C, we obtain a maximal hierarchy Cc of Q with C G Cc C C, 
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in contradiction to our assumption. 



Combining Lemma [3.41 and Lemma [3.51 yields the following result. 

Corollary 3.6 Let Q be a set of quartets with C G Exc(Q). IfExc(Q) con- 
tains a maximal hierarchy of Q, then Exc(Q) contains a maximal hierarchy 
C c ofQ with C G C c - 

We are now able to establish the hardest implication of the characteriza- 
tion of excess-free definitive quartet sets. 

Theorem 3.7 If an excess-free set Q of quartets defines a binary phyloge- 
netic tree T, then Exc(Q) contains a maximal hierarchy of Q. 

Proof: To simplify the notation, we call a maximal hierarchy of a subset 
Q s C Q that is contained in Exc(Q) a Q s -hierarchy. 

Assume that Q is a minimal counterexample where {wi,x\} is a cherry 
of T and WiXi\w2y G Q. Note that Q is slim in view of Corollary 13.21 Then 
there is a subset Q' of Q — W\x-i\w2y such that Q' contains w 2 and y and 
exc(Q') = since otherwise, by Theorem 11.11 the quartet set obtained from 
Q by identifying w 2 and y would be compatible and thus there would be 
a phylogenetic tree displaying Q that contains a cherry {w 2 ,y}. However, 
{w 2 ,y} is not a cherry of T since WiXi\w 2 y distinguishes an edge of T. 

For every subset Q s of Q — WiXi\w 2 y, we let Q* denote the quartet set 
obtained from Q s by replacing w\ and x\ by a new leaf Z\ and we define 
Q* — (Q ~ w i x i\ w 2y)* ■ Conversely, for Q*, C Q*, we use Q s i to denote the 
quartet set obtained from Q*, by reversing the replacement. Note that Q s i is 
well defined since Q is excess-free and defines T. Hence, every quartet in Q 
distinguishes an interior edge of T and every interior edge of T is distinguished 
by exactly one quartet. Clearly, Q* defines the tree T* obtained from T by 
replacing the cherry {wi, x±} by z\. Further, since we have \Q*\ < |L(T*)| — 3 
and Lemma EH] implies \Q*\ > \L(T*)\— 3, Q* is excess-free thus, by induction 
hypothesis, (Q* ,Exc(Q*)) is an ample patchwork. 
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Since exc(Q') = and Q is slim we have exc(Q'*) = thus, by Corollary 
13.61 there is a Q*-hierarchy that contains Q'*. Hence, one child Q\ of Q* 
contains w 2 and y. Let Q 2 be the other child of Q*. We note that \L(Q*) fl 
L(Q$\ = 3. 

We will now consider the case z\ G L(Q\) n L{Q* 2 ). Let L(Q*) H = 
{zi,z 2 ,zs}. Then Qi + u>i£i|u>2y is excess-free and in view of Corollary 13.21 
definitive. By induction hypothesis, there is a (Qi + WiXi|w7 2 ?/)-hierarchy 
C%. Since Q is definitive Q 2 + WiXi\z 2 z 3 is definitive because every tree that 
displays Q 2 + wiXi\z 2 z 3 and the tree defined by Qi + wiXi\w 2 y are compatible 
in view of Lemma 13.31 Therefore, there is a (Q 2 + lOi^i^^-hierarchy C 2 . 
Let C' 2 be the set system that is obtained from C 2 by replacing every cluster 
C that contains W\X\ \z 2 z 3 by (C — WiXi| 2:2-23) U (Qi + lOiXil^y)- Then 
Ci U C 2 is a Q-hierarchy, contradicting our assumption. Hence, we have 
Zl ^L(Ql)nL{Q* 2 ). 

If Zi G L(Q\) and ^ -^(Q^) tnen exc (Qi + ^1^1 1^22/) = exc(Q2) = 
thus there is a Q-hierarchy where Qi+wiXi\w 2 y and Q 2 = Q 2 are the children 
of Q. Hence, we have z\ G L(Q 2 ) and Z\ L{Q\). We can assume that 
w 2 L(Q 2 ) since in case w 2 , y G L(Q 2 ) interchanging QI and Q 2 would yield 
{w 2 ,y, zi} C L(Ql), a case that we have already considered. If y L(Q 2 ) 
then let L(Q 1 )nL(Q 2 ) = {vi,v 2 ,v 3 }. Lemma I3T31 implies that different edges 
of the phylogenetic tree with leaf set {v\, v 2 , v%} are subdivided in T\l(q^ and 
T\l(q 2 ) but then w\Xi\w 2 y does not distinguish an edge of T, a contradiction 
to Lemma [3. II 

The only remaining case is w 2 , y G L(Ql), z\ L(Q\), y, z\ G L(Q 2 ), and 
w 2 L(Q* 2 ). Let L(Q\) n L(Q* 2 ) = {y,x 2 ,w 3 }. In view of Lemma 1331 the 
edge sets in the phylogenetic tree with leaf set {y,x 2 ,w 3 } that have to be 
subdivided to obtain T\l{q{) respectively T\l(q 2 ) are disjoint. On the other 
hand, is a cherry of T\l(q 2 ) while w 2 is in L{Qi). Since wiXi\w 2 y 

distinguishes an edge of T it follows that w 2 y\x 2 w% is not displayed by T. 
Hence, we can assume without loss of generality that T displays w 2 x 2 \w 3 y. 
Further, we have Qi = QI thus there is a Qx-hierarchy. Let T 2 be the 
tree defined by Q 2 . Then the edge of the phylogenetic tree with leaf set 
{x 2 , W3, y} that is incident with w 2 does not have to be subdivided to obtain 
T 2 . Therefore, Q 2 + w 2 x 2 \yw3 is definitive. Further, the quartet set Q 2 := 
Q 2 + w 2 x 2 \yw 3 + WiXi\w 2 y is definitive since every tree that displays Q 2 is 
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compatible with the tree defined by Qi in view of Lemma 13. 3[ and Q\ is 
excess-free. If |Qi| > 1, then \Q\\ < Q thus, by induction hypothesis, there 
is a (Q2+^i^i|^22/+^2^2|^32/)-hierarchy and we can construct a Q-hierarchy 
in the same way as for the case z\ G L{Q\) flL^^)- Hence, Qi = {w2X2\wsy} 
thus {^2,^2} is a cherry of T and W\X\\w2y and ^2^2(^37/ are the only two 
quartets in Q which contain U7 2 . 

We have shown that, for every cherry {w, x} of T with wx\w'y' G Q, there 
is z' G {w', y'} such that the following statements hold: 

(PI) z' is contained in a cherry {z', x'} of T. 

(P2) There are exactly two quartets in Q which contain z' . 

(P3) There is w" G L(T) such that w'x'\w"y' G Q (if z' = w') or x'y'\w'w" G 
Q (if/ = y'). 

We will use Properties (P1)-(P3) of Q and T to show that, for every natural 
number i, the following statements hold: 

(P4) There is a leaf set Li = {wj : 1 < j < % + 1} U {xj : 1 < j < 
z} U {y} C L(T) of cardinality 2i + 2 such that T\ L . is the phylogenetic 
tree % with vertex set Lj U : 1 < j < and edge set {yui} U 

{ujVj,VjWj,VjXj,UjUj + i : 1 < j < i} where we define u i+ \ = w i+ %. 

(P5) {wi,Xi} is a cherry of T. 

(P6) WiXi\w i+1 y G 

This will complete the proof of the theorem since it contradicts the finiteness 
of T. 

We have already shown that all claimed statements hold for i < 2. As- 
sume i > 3 and that all claimed statements are true for smaller i. Since, 
by induction hypothesis, {wi-i,Xi-i} is a cherry of T in view of (P5) and 
Wi-iXi-i\wiy G Q in view of (P6), (PI) implies that there is a leaf Xi of 
T and z G {wi,y} such that {z,Xi} is a cherry of T. Since = Tj_! 

we have Xi ^ A-i- By (P2), there are exactly two quartets in Q which 
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contain z. Since y is contained in the quartets WiXi\w 2 y and w 2 x 2 \w 3 y and 
there is a quartet containing Xi and y if {xi,y} is a cherry of T we can not 
have z = y. Hence, z = Wi and (P3) implies that there is u>j+i G 
with G Since distinguishes an edge of T we have 

T\ Li = Ti. " l 

The characterization of excess-free defining quartet sets which was origi- 
nally published in [6] is an easy consequence of Theorem 13.71 

Corollary 3.8 Let T be a binary phylogenetic X-tree and let Q be an excess- 
free quartet set displayed by T with L(Q) = L(T). Then Q defines T if and 
only if (i) every interior edge of T is distinguished by a quartet in Q and 
(ii) Exc(Q) contains a maximal hierarchy. 

Proof: If Q defines T then (i) follows from Lemma 1331 and (ii) from Theorem 
13.71 Conversely, we use induction on |L(T)| to show that Q defines T if (i) 
and (ii) hold. This is clearly true if |£(Q)| = 4 thus we can assume that 
> 5 and there is a maximal hierarchy contained in Exc(Q) where Qi 
and Q 2 are the children of Q. By induction hypothesis, Qi and Q 2 define 
binary trees T\ and T 2 , respectively. For i G {1,2}, every quartet in Qi 
distinguishes an interior edge of T. Hence, the interior edges of Tj are not 
subdivided in the smallest subtree of T containing L(Ti). Therefore, the inte- 
rior edges of T distinguished by a quartet in Qi induce a connected subgraph 
T[ of T which implies that T[ and T' 2 have exactly one vertex v in common. 
Let L(Ti) PI L{T 2 ) = {x,y,z}. By symmetry, we can assume that {x,y} is 
a cherry of T\ and z is an isolated vertex in the graph obtained from T 2 by 
removing the path from x to y. Hence, only the edge of the phylogenetic 
tree with leaf set {x, y, z} that is incident with z has to be subdivided to ob- 
tain T\ while one or both of the other edges have to be subdivided to obtain 
T 2 . Therefore, Lemma [3731 implies that T\ and T 2 define T, thus Q defines T. I 

We conclude the paper by pointing out one of the algorithmic conse- 
quences of Corollary 13.81 that are studied in [3]. 

Corollary 3.9 It can be decided in polynomial time if an excess-free quartet 



13 



set Q is definitive and then the phylogenetic tree displaying Q can be recon- 
structed in polynomial time. 
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